142 PART 4 Comparing Groups
Grasping Why Different Situations
Need Different Tests
You may wonder why there are so many tests for such a simple task as comparing
averages. Well, “comparing averages” doesn’t refer to a specific situation. It’s a
broad term that can apply to different situations where you are trying to compare
averages. These situations can differ from each other on the basis of these and
other factors, which are listed here in order of most to least common:»
» Within or between: You could be testing differences within groups or
differences between groups.»
» Number of time points: You could be testing differences occurring at one
point or over a number of time points.»
» Number of groups: You could be testing differences between two groups or
between three or more groups.»
» Distribution of outcome: Your outcome measurement could follow the
normal distribution or some other distribution (see Chapter 3).»
» Variation: You could be testing the differences in variation or spread across
groups (see Chapter 3).
These different factors can occur in any and all combinations, so there are a lot of
potential scenarios. In the following sections, we review situations you may fre-
quently encounter when analyzing biological data, and advise you as to how to
select the most appropriate testing approach given the situation.
Comparing the mean of a group of numbers
to a hypothesized value
Sometimes you have a measurement from the literature (called a historical control)
that provides a hypothesized value of your measurement, and you want to statis-
tically compare the average of a group to this mean. This situation is common
when you are comparing a value that was calculated based on statistical norms
derived based on the population (such as the IQ test, where 100 has been scaled to
be the population mean).
Typically, comparing a group mean to a historical control warrants using the one-
group Student t test that we describe in the later section “Surveying Student t tests.”
For data that are not normally distributed, the Wilcoxon Signed-Ranks (WSR) test
can be used instead, although it is not used often so we do not cover it in this
chapter. (If you need a review on what normally distributed means, see Chapter 3.)